智能论文笔记

Deep metric learning improves lab of origin prediction of genetically engineered plasmids

Igor M. Soares , Fernando H. F. Camargo , Adriano Marques , Oliver M. Crook

分类：机器学习 | 人工智能 | 神经与进化计算

2021-11-24

基因组工程正在进行前所未有的发展，现在已广泛可用。为确保负责任的生物技术创新并减少滥用工程DNA序列，为识别工程型质粒的起源实验室来说是至关重要的。基因工程归因（GEA），制定序列实验室协会的能力将支持这一过程中的法医专家。在这里，我们提出了一种基于度量学习的方法，该方法将最可能的原产实验室排名，同时为质粒序列和实验室产生嵌入。这些嵌入物可用于执行各种下游任务，例如聚类DNA序列和实验室，以及在机器学习模型中使用它们作为特征。我们的方法采用了循环转移增强方法，能够在前10个预测中正确地将原产于原产的90亿美元的时间排列 - 优于所有最新的最先进的方法。我们还证明我们可以使用只需10次\％$ 10 \％$ of序列进行几次拍摄学习并获得76±10美元的准确性。这意味着，我们仅使用第十个数据表达先前的CNN方法。我们还证明我们能够在特定实验室中提取质粒序列中的关键签名，允许对模型的产出进行可解释的检查。

translated by 谷歌翻译

GT-CausIn: a novel causal-based insight for traffic prediction

Ting Gao , Rodrigo Kappes Marques , Lei Yu

分类：机器学习

2022-12-12

Traffic forecasting is an important application of spatiotemporal series prediction. Among different methods, graph neural networks have achieved so far the most promising results, learning relations between graph nodes then becomes a crucial task. However, improvement space is very limited when these relations are learned in a node-to-node manner. The challenge stems from (1) obscure temporal dependencies between different stations, (2) difficulties in defining variables beyond the node level, and (3) no ready-made method to validate the learned relations. To confront these challenges, we define legitimate traffic causal variables to discover the causal relation inside the traffic network, which is carefully checked with statistic tools and case analysis. We then present a novel model named Graph Spatial-Temporal Network Based on Causal Insight (GT-CausIn), where prior learned causal information is integrated with graph diffusion layers and temporal convolutional network (TCN) layers. Experiments are carried out on two real-world traffic datasets: PEMS-BAY and METR-LA, which show that GT-CausIn significantly outperforms the state-of-the-art models on mid-term and long-term prediction.

translated by 谷歌翻译

Joint graph learning from Gaussian observations in the presence of hidden nodes

Samuel Rey , Madeline Navarro , Andrei Buciulea , Santiago Segarra , Antonio G. Marques

分类：机器学习

2022-12-04

Graph learning problems are typically approached by focusing on learning the topology of a single graph when signals from all nodes are available. However, many contemporary setups involve multiple related networks and, moreover, it is often the case that only a subset of nodes is observed while the rest remain hidden. Motivated by this, we propose a joint graph learning method that takes into account the presence of hidden (latent) variables. Intuitively, the presence of the hidden nodes renders the inference task ill-posed and challenging to solve, so we overcome this detrimental influence by harnessing the similarity of the estimated graphs. To that end, we assume that the observed signals are drawn from a Gaussian Markov random field with latent variables and we carefully model the graph similarity among hidden (latent) nodes. Then, we exploit the structure resulting from the previous considerations to propose a convex optimization problem that solves the joint graph learning task by providing a regularized maximum likelihood estimator. Finally, we compare the proposed algorithm with different baselines and evaluate its performance over synthetic and real-world graphs.

translated by 谷歌翻译

Privacy Meets Explainability: A Comprehensive Impact Benchmark

Saifullah Saifullah , Dominique Mercier , Adriano Lucieri , Andreas Dengel , Sheraz Ahmed

分类：机器学习 | 人工智能 | 计算机视觉

2022-11-08

Since the mid-10s, the era of Deep Learning (DL) has continued to this day, bringing forth new superlatives and innovations each year. Nevertheless, the speed with which these innovations translate into real applications lags behind this fast pace. Safety-critical applications, in particular, underlie strict regulatory and ethical requirements which need to be taken care of and are still active areas of debate. eXplainable AI (XAI) and privacy-preserving machine learning (PPML) are both crucial research fields, aiming at mitigating some of the drawbacks of prevailing data-hungry black-box models in DL. Despite brisk research activity in the respective fields, no attention has yet been paid to their interaction. This work is the first to investigate the impact of private learning techniques on generated explanations for DL-based models. In an extensive experimental analysis covering various image and time series datasets from multiple domains, as well as varying privacy techniques, XAI methods, and model architectures, the effects of private training on generated explanations are studied. The findings suggest non-negligible changes in explanations through the introduction of privacy. Apart from reporting individual effects of PPML on XAI, the paper gives clear recommendations for the choice of techniques in real applications. By unveiling the interdependencies of these pivotal technologies, this work is a first step towards overcoming the remaining hurdles for practically applicable AI in safety-critical domains.

translated by 谷歌翻译

Machine Learning for Improved Gas Network Models in Coordinated Energy Systems

Adriano Arrigo , Mihály Dolányi , Kenneth Bruninx , Jean-François Toubeau

分类：机器学习

2022-09-26

当前的能源转变促进了电力和天然气系统之间的运行融合。在这个方向上，改善协调能力和气体调度内的非凸天然气体流动动力学的建模至关重要。在这项工作中，我们提出了一种神经网络受限的优化方法，其中包括基于监督机器学习的韦茅斯方程的回归模型。 Weymouth方程将气体流动与每个管道的入口和出口压力通过二次平等，该二次相等性，该平等被神经网络捕获。后者是通过可处理的混合插入线性程序编码为约束集的。此外，我们提出的框架能够考虑双向性，而无需求助于复杂且可能不准确的凸化方法。我们通过引入激活函数的重新制定来进一步增强我们的模型，从而提高计算效率。一项基于现实生活中的比利时力量和气体系统的广泛数值研究表明，所提出的方法在准确性和障碍方面产生了有希望的结果。

translated by 谷歌翻译

Machine Learning guided high-throughput search of non-oxide garnets

Jonathan Schmidt , Haichen Wang , Georg Schmidt , Miguel Marques

分类：机器学习

2022-08-29

自从人类文明的早期阶段以来已知的石榴石在现代技术中发现了重要的应用，包括磁性限制，Spintronics，锂电池等。绝大多数实验性的石榴石是氧化物，而探索（实验或理论）在其余的探索中是氧化物化学空间的范围受到限制。一个关键问题是石榴石结构具有较大的原始单位单元格，需要大量的计算资源。为了对新石榴石的完整化学空间进行全面搜索，我们将图形神经网络中的最新进展与高通量计算结合在一起。我们应用机器学习模型来在系统密度功能的计算之前识别电势（meta-）稳定的石榴石系统以验证预测。通过这种方式，我们发现了600多个三元石榴石，距凸壳以下的凸壳距离低于100〜MEV/ATOM，具有各种物理和化学性质。这包括硫化物，氮化物和卤化物石榴石。为此，我们分析电子结构，并讨论电子带隙和电荷平衡的值之间的联系。

translated by 谷歌翻译

HTML版本

Pavlov Learning Machines

Elena Agliari , Miriam Aquaro , Adriano Barra , Alberto Fachechi , Chiara Marullo

分类： (统计)机器学习

2022-07-02

众所周知，HEBB的学习探索了帕夫洛夫的古典条件，而前者在过去几十年中进行了广泛的建模（例如，通过Hopfield模型和无数的主题变化），因为后者的建模在很大程度上保持了很大的含糊状态。远的;此外，完全缺乏这两个支柱之间的桥梁。实现该目标的主要困难置于所涉及的信息的本质上不同的范围：帕夫洛夫的理论是关于\ emph {concepts}之间的相关性（动态地）存储在突触矩阵中，这是由狗和一个戒指主演的著名实验所体现的钟;相反，HEBB的理论是关于相邻神经元对之间的相关性，如著名的陈述{\ em神经元一起发射汇合的}所总结。在本文中，我们依靠随机过程理论以及通过langevin方程进行神经和突触动力学模型，以证明 - 只要我们保持神经元和突触的时间表的大量分裂，Pavlov机制就会自发地发生并最终产生至恢复Hebbian内核的突触重量。

translated by 谷歌翻译

Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion Classification

Adriano Lucieri , Fabian Schmeisser , Christoph Peter Balada , Shoaib Ahmed Siddiqui , Andreas Dengel , Sheraz Ahmed

分类：计算机视觉 | 人工智能 | 机器学习

2022-06-13

人们普遍认为，人类视觉系统偏向于识别形状而不是纹理。这一假设导致了越来越多的工作，旨在使深层模型的决策过程与人类视野的基本特性保持一致。人们对形状特征的依赖主要预计会改善协变量转移下这些模型的鲁棒性。在本文中，我们重新审视了形状偏置对皮肤病变图像分类的重要性。我们的分析表明，不同的皮肤病变数据集对单个图像特征表现出不同的偏见。有趣的是，尽管深层提取器倾向于学习对皮肤病变分类的纠缠特征，但仍然可以从该纠缠的表示形式中解码单个特征。这表明这些功能仍在模型的学习嵌入空间中表示，但不用于分类。此外，不同数据集的光谱分析表明，与常见的视觉识别相反，皮肤皮肤病变分类本质上依赖于超出形状偏置的复杂特征组合。自然的结果，在某些情况下，摆脱了形状偏见模型的普遍欲望甚至可以改善皮肤病变分类器。

translated by 谷歌翻译

Self-Supervised Learning for Building Damage Assessment from Large-scale xBD Satellite Imagery Benchmark Datasets

Zaishuo Xia , Zelin Li , Yanbing Bai , Jinze Yu , Bruno Adriano

分类：计算机视觉 | 人工智能

2022-05-31

在灾难后评估领域，为了及时准确的救援和本地化，人们需要知道损坏的建筑物的位置。在深度学习中，一些学者提出了通过遥感图像进行自动且高度准确的建筑损害评估的方法，事实证明，这些方法比域专家评估更有效。但是，由于缺乏大量标记的数据，这些任务可能因能够进行准确的评估而遭受损失，因为深度学习模型的效率高度依赖于标记的数据。尽管现有的半监督和无监督研究在这一领域取得了突破，但它们都没有完全解决这个问题。因此，我们建议采用一种自制的比较学习方法来解决任务，而无需标记数据。我们构建了一个新颖的非对称双网络架构，并在XBD数据集上测试了其性能。我们模型的实验结果表明，与基线和常用方法相比，改善了。我们还展示了自我监督方法建立损害识别意识的潜力。

translated by 谷歌翻译

Mitigating Bias in Facial Analysis Systems by Incorporating Label Diversity

Camila Kolling , Victor Araujo , Adriano Veloso , Soraia Raupp Musse

分类：计算机视觉 | 人工智能

2022-04-13

面部分析模型越来越多地应用于对人们生活产生重大影响的现实应用中。但是，正如文献所表明的那样，自动对面部属性进行分类的模型可能会对受保护的群体表现出算法歧视行为，从而对个人和社会产生负面影响。因此，开发可以减轻面部分类器中意外偏见的技术至关重要。因此，在这项工作中，我们引入了一种新颖的学习方法，该方法将基于人类的主观标签和基于面部特征的数学定义的客观注释结合在一起。具体而言，我们从两个大规模的人类注销数据集中生成了新的客观注释，每个数据集都捕获了分析的面部特征的不同观点。然后，我们提出了一种合奏学习方法，该方法结合了接受不同类型注释的单个模型。我们对注释过程以及数据集分布提供了深入的分析。此外，我们从经验上证明，通过结合标签多样性，我们的方法成功地减轻了意外偏见，同时保持了下游任务的明显准确性。

translated by 谷歌翻译